Overview

Dataset Statistics

Number of Variables 17
Number of Rows 466
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 65.5 KB
Average Row Size in Memory 144.0 B
Variable Types
  • Numerical: 16
  • Categorical: 1

Dataset Insights

feature0 is skewed Skewed
feature1 is skewed Skewed
feature2 is skewed Skewed
feature3 is skewed Skewed
feature4 is skewed Skewed
feature5 is skewed Skewed
feature6 is skewed Skewed
feature7 is skewed Skewed
feature8 is skewed Skewed
feature9 is skewed Skewed
feature10 is skewed Skewed
feature11 is skewed Skewed
feature12 is skewed Skewed
feature13 is skewed Skewed
feature14 is skewed Skewed
feature15 is skewed Skewed
target has constant length 1 Constant Length
feature3 has 144 (30.9%) negatives Negatives
feature4 has 466 (100.0%) negatives Negatives
feature0 has 66 (14.16%) zeros Zeros
feature1 has 66 (14.16%) zeros Zeros
feature2 has 172 (36.91%) zeros Zeros
feature8 has 172 (36.91%) zeros Zeros
feature11 has 177 (37.98%) zeros Zeros
feature12 has 172 (36.91%) zeros Zeros
feature13 has 172 (36.91%) zeros Zeros
feature14 has 172 (36.91%) zeros Zeros
feature15 has 172 (36.91%) zeros Zeros
  • 1
  • 2
  • 3

Variables


feature0

numerical

Approximate Distinct Count 60
Approximate Unique (%) 12.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.3 KB
Mean 438.7436
Minimum 0
Maximum 15400
Zeros 66
Zeros (%) 14.2%
Negatives 0
Negatives (%) 0.0%
  • feature0 is skewed right (γ1 = 9.4631)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 50
Median 150
Q3 500
95-th Percentile 1550
Maximum 15400
Range 15400
IQR 450

Descriptive Statistics

Mean 438.7436
Standard Deviation 984.5931
Variance 969423.504
Sum 204454.5
Skewness 9.4631
Kurtosis 124.4518
Coefficient of Variation 2.2441
  • feature0 is not normally distributed (p-value 3.359348030545878e-23)
  • feature0 has 44 outliers

feature1

numerical

Approximate Distinct Count 30
Approximate Unique (%) 6.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.3 KB
Mean 4.8476
Minimum 0
Maximum 31
Zeros 66
Zeros (%) 14.2%
Negatives 0
Negatives (%) 0.0%
  • feature1 is skewed right (γ1 = 2.1986)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 1
Median 2
Q3 6
95-th Percentile 21.75
Maximum 31
Range 31
IQR 5

Descriptive Statistics

Mean 4.8476
Standard Deviation 6.8367
Variance 46.7402
Sum 2259
Skewness 2.1986
Kurtosis 4.459
Coefficient of Variation 1.4103
  • feature1 is not normally distributed (p-value 4.9462305125484626e-17)
  • feature1 has 47 outliers

feature2

numerical

Approximate Distinct Count 291
Approximate Unique (%) 62.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.3 KB
Mean 1244.3225
Minimum 0
Maximum 40291.24
Zeros 172
Zeros (%) 36.9%
Negatives 0
Negatives (%) 0.0%
  • feature2 is skewed right (γ1 = 6.9772)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 169.83
Q3 1017.375
95-th Percentile 5671.1625
Maximum 40291.24
Range 40291.24
IQR 1017.375

Descriptive Statistics

Mean 1244.3225
Standard Deviation 3558.699
Variance 1.2664e+07
Sum 579854.27
Skewness 6.9772
Kurtosis 63.6843
Coefficient of Variation 2.8599
  • feature2 is not normally distributed (p-value 2.7422216871500636e-24)
  • feature2 has 54 outliers

feature3

numerical

Approximate Distinct Count 458
Approximate Unique (%) 98.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.3 KB
Mean 44.6009
Minimum -645.87
Maximum 1521.9
Zeros 4
Zeros (%) 0.9%
Negatives 144
Negatives (%) 30.9%
  • feature3 is skewed right (γ1 = 4.6758)

Quantile Statistics

Minimum -645.87
5-th Percentile -21.55
Q1 -0.4
Median 18.36
Q3 44.63
95-th Percentile 268.1275
Maximum 1521.9
Range 2167.77
IQR 45.03

Descriptive Statistics

Mean 44.6009
Standard Deviation 122.0935
Variance 14906.8264
Sum 20784.01
Skewness 4.6758
Kurtosis 50.7718
Coefficient of Variation 2.7375
  • feature3 is not normally distributed (p-value 2.6395004291803865e-18)
  • feature3 has 62 outliers

feature4

numerical

Approximate Distinct Count 440
Approximate Unique (%) 94.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.3 KB
Mean -434.2999
Minimum -15506.35
Maximum -0.26
Zeros 0
Zeros (%) 0.0%
Negatives 466
Negatives (%) 100.0%
  • feature4 is skewed left (γ1 = -9.7178)

Quantile Statistics

Minimum -15506.35
5-th Percentile -1561.1625
Q1 -492.035
Median -154.525
Q3 -50.18
95-th Percentile -4.0875
Maximum -0.26
Range 15506.09
IQR 441.855

Descriptive Statistics

Mean -434.2999
Standard Deviation 975.5552
Variance 951707.9447
Sum -202383.75
Skewness -9.7178
Kurtosis 131.2814
Coefficient of Variation -2.2463
  • feature4 is not normally distributed (p-value 1.4977058790469655e-23)
  • feature4 has 43 outliers

feature5

numerical

Approximate Distinct Count 343
Approximate Unique (%) 73.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.3 KB
Mean 0.474
Minimum 0.15
Maximum 3.15
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • feature5 is skewed right (γ1 = 2.5745)

Quantile Statistics

Minimum 0.15
5-th Percentile 0.15
Q1 0.1737
Median 0.3039
Q3 0.5698
95-th Percentile 1.417
Maximum 3.15
Range 3
IQR 0.3962

Descriptive Statistics

Mean 0.474
Standard Deviation 0.4523
Variance 0.2045
Sum 220.8799
Skewness 2.5745
Kurtosis 7.9339
Coefficient of Variation 0.9542
  • feature5 is not normally distributed (p-value 6.651366626574512e-19)
  • feature5 has 35 outliers

feature6

numerical

Approximate Distinct Count 387
Approximate Unique (%) 83.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.3 KB
Mean 979.0708
Minimum 1
Maximum 11731
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • feature6 is skewed right (γ1 = 3.5895)

Quantile Statistics

Minimum 1
5-th Percentile 23.25
Q1 133.5
Median 420
Q3 1238.75
95-th Percentile 3275.5
Maximum 11731
Range 11730
IQR 1105.25

Descriptive Statistics

Mean 979.0708
Standard Deviation 1460.7384
Variance 2.1338e+06
Sum 456247
Skewness 3.5895
Kurtosis 18.0076
Coefficient of Variation 1.492
  • feature6 is not normally distributed (p-value 5.885077909685861e-19)
  • feature6 has 33 outliers

feature7

numerical

Approximate Distinct Count 466
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.3 KB
Mean 0.1139
Minimum 0.00066287
Maximum 40
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • feature7 is skewed right (γ1 = 20.8362)

Quantile Statistics

Minimum 0.00066287
5-th Percentile 0.001927
Q1 0.003318
Median 0.005127
Q3 0.009699
95-th Percentile 0.04011
Maximum 40
Range 39.9993
IQR 0.006381

Descriptive Statistics

Mean 0.1139
Standard Deviation 1.8737
Variance 3.5109
Sum 53.0702
Skewness 20.8362
Kurtosis 439.7368
Coefficient of Variation 16.453
  • feature7 is not normally distributed (p-value 4.228638259468358e-25)
  • feature7 has 48 outliers

feature8

numerical

Approximate Distinct Count 295
Approximate Unique (%) 63.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.3 KB
Mean 3.857
Minimum 0
Maximum 281.6667
Zeros 172
Zeros (%) 36.9%
Negatives 0
Negatives (%) 0.0%
  • feature8 is skewed right (γ1 = 13.3282)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0.9371
Q3 2.8158
95-th Percentile 12.7914
Maximum 281.6667
Range 281.6667
IQR 2.8158

Descriptive Statistics

Mean 3.857
Standard Deviation 15.6091
Variance 243.645
Sum 1797.3667
Skewness 13.3282
Kurtosis 219.2474
Coefficient of Variation 4.047
  • feature8 is not normally distributed (p-value 6.282164126168523e-25)
  • feature8 has 54 outliers

feature9

numerical

Approximate Distinct Count 465
Approximate Unique (%) 99.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.3 KB
Mean 216437.824
Minimum 1
Maximum 3366472
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • feature9 is skewed right (γ1 = 3.7457)

Quantile Statistics

Minimum 1
5-th Percentile 1052.5
Q1 21131
Median 85328.5
Q3 264503.5
95-th Percentile 857257.75
Maximum 3366472
Range 3366471
IQR 243372.5

Descriptive Statistics

Mean 216437.824
Standard Deviation 350862.169
Variance 1.231e+11
Sum 1.0086e+08
Skewness 3.7457
Kurtosis 20.5036
Coefficient of Variation 1.6211
  • feature9 is not normally distributed (p-value 9.299652820838798e-22)
  • feature9 has 38 outliers

feature10

numerical

Approximate Distinct Count 446
Approximate Unique (%) 95.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.3 KB
Mean 6930.4564
Minimum 0
Maximum 237182.78
Zeros 20
Zeros (%) 4.3%
Negatives 0
Negatives (%) 0.0%
  • feature10 is skewed right (γ1 = 6.8878)

Quantile Statistics

Minimum 0
5-th Percentile 51.715
Q1 383.6875
Median 1410.855
Q3 5212.9775
95-th Percentile 37783.3075
Maximum 237182.78
Range 237182.78
IQR 4829.29

Descriptive Statistics

Mean 6930.4564
Standard Deviation 17581.8008
Variance 3.0912e+08
Sum 3.2296e+06
Skewness 6.8878
Kurtosis 70.3254
Coefficient of Variation 2.5369
  • feature10 is not normally distributed (p-value 1.9259034441808673e-24)
  • feature10 has 61 outliers

feature11

numerical

Approximate Distinct Count 289
Approximate Unique (%) 62.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.3 KB
Mean 0.4373
Minimum 0
Maximum 73.0806
Zeros 177
Zeros (%) 38.0%
Negatives 0
Negatives (%) 0.0%
  • feature11 is skewed right (γ1 = 20.2838)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0.08251
Q3 0.3113
95-th Percentile 1.0281
Maximum 73.0806
Range 73.0806
IQR 0.3113

Descriptive Statistics

Mean 0.4373
Standard Deviation 3.4421
Variance 11.848
Sum 203.7974
Skewness 20.2838
Kurtosis 424.6204
Coefficient of Variation 7.8706
  • feature11 is not normally distributed (p-value 4.35066657266795e-25)
  • feature11 has 32 outliers

feature12

numerical

Approximate Distinct Count 295
Approximate Unique (%) 63.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.3 KB
Mean 61.8862
Minimum 0
Maximum 2232.1
Zeros 172
Zeros (%) 36.9%
Negatives 0
Negatives (%) 0.0%
  • feature12 is skewed right (γ1 = 8.64)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 19.6926
Q3 65.4981
95-th Percentile 282.2087
Maximum 2232.1
Range 2232.1
IQR 65.4981

Descriptive Statistics

Mean 61.8862
Standard Deviation 142.5215
Variance 20312.3846
Sum 28838.9646
Skewness 8.64
Kurtosis 116.2135
Coefficient of Variation 2.303
  • feature12 is not normally distributed (p-value 1.38731992494556e-23)
  • feature12 has 45 outliers

feature13

numerical

Approximate Distinct Count 286
Approximate Unique (%) 61.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.3 KB
Mean 0.008634
Minimum 0
Maximum 0.2046
Zeros 172
Zeros (%) 36.9%
Negatives 0
Negatives (%) 0.0%
  • feature13 is skewed right (γ1 = 5.3968)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0.003207
Q3 0.009515
95-th Percentile 0.03505
Maximum 0.2046
Range 0.2046
IQR 0.009515

Descriptive Statistics

Mean 0.008634
Standard Deviation 0.01787
Variance 0.0003192
Sum 4.0235
Skewness 5.3968
Kurtosis 41.1411
Coefficient of Variation 2.0692
  • feature13 is not normally distributed (p-value 2.0176317441961577e-22)
  • feature13 has 38 outliers

feature14

numerical

Approximate Distinct Count 290
Approximate Unique (%) 62.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.3 KB
Mean 87.7134
Minimum 0
Maximum 2154
Zeros 172
Zeros (%) 36.9%
Negatives 0
Negatives (%) 0.0%
  • feature14 is skewed right (γ1 = 7.923)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 72.5243
Q3 127.3015
95-th Percentile 231.6022
Maximum 2154
Range 2154
IQR 127.3015

Descriptive Statistics

Mean 87.7134
Standard Deviation 145.4264
Variance 21148.8486
Sum 40874.4257
Skewness 7.923
Kurtosis 95.5298
Coefficient of Variation 1.658
  • feature14 is not normally distributed (p-value 8.861510053414142e-17)
  • feature14 has 12 outliers

feature15

numerical

Approximate Distinct Count 56
Approximate Unique (%) 12.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.3 KB
Mean 10.3133
Minimum 0
Maximum 541
Zeros 172
Zeros (%) 36.9%
Negatives 0
Negatives (%) 0.0%
  • feature15 is skewed right (γ1 = 10.2825)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 2
Q3 8
95-th Percentile 42.5
Maximum 541
Range 541
IQR 8

Descriptive Statistics

Mean 10.3133
Standard Deviation 33.6252
Variance 1130.6543
Sum 4806
Skewness 10.2825
Kurtosis 141.8142
Coefficient of Variation 3.2604
  • feature15 is not normally distributed (p-value 1.2070625271292856e-24)
  • feature15 has 53 outliers

target

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.4%
Missing 0
Missing (%) 0.0%
Memory Size 30.0 KB

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 466
  • The top 2 categories (1, 0) take over 50.0%
  • target has words of constant length

Interactions

Correlations

Missing Values